High Availability Issues in DSM Systems: Research Opportunities
نویسنده
چکیده
shared memory multiprocessors, high availability This report documents a first-cut understanding of the HA issues in DSM systems. We discuss the general HA strategy, advocate for minimizing fault propagation, system reconfiguration time and performance degradation as the distinctive goals for the three stages that the system goes through after the occurrence of a fault till full recovery. We show the possibility of estimating the impact of a fault through hierarchical component dependency analysis. We point out that coherent protocols should be extended and transactions be made closed in order to detect the fault and maintain data integrity. In particular, we propose source-buffering to augment dirty data transfer protocol in preparing for possible data loss and corruption. N+1 stand-by system is suggested as the ultimate HA solution. Further research opportunities are discussed. This report skims through a broad range of issues, but it does not attempt to treat each of them in depth.
منابع مشابه
Evaluation of Energy Storage Technologies and Applications Pinpointing Renewable Energy Resources Intermittency Removal
Renewable energy sources (RES), especially wind power plants, have high priority of promotion in the energy policies worldwide. An increasing share of RES and distributed generation (DG), should, as has been assumed, provide improvement in reliability of electricity delivery to the customers. Paper presented here concentrates on electricity storage systems technologies and applications pinpoint...
متن کاملReplication of Checkpoints in Recoverable DSM Systems
This paper presents a new technique of recovery for object-based Distributed Shared Memory (DSM) systems. The new technique, integrated with a coherence protocol for atomic consistency model, offers high availability of shared objects in spite of multiple node and communication failures, introducing little overhead. It ensures fast recovery in case of multiple node failures and enables a DSM sy...
متن کاملEvaluation of Energy Storage Technologies and Applications Pinpointing Renewable Energy Resources Intermittency Removal
Renewable energy sources (RES), especially wind power plants, have high priority of promotion in the energy policies worldwide. An increasing share of RES and distributed generation (DG), should, as has been assumed, provide improvement in reliability of electricity delivery to the customers. Paper presented here concentrates on electricity storage systems technologies and applications pinpoint...
متن کاملDesign and Analysis of Highly Availbalbe and Scalable Coherence Protocols for Distributed Shared Memory Systems Using Stochastic Modeling
Larger size networks require DSM coherence protocols which scale well. Fault-tolerance in terms of high availability is required for data access and for uninterrupted DSM service since large-scale environments have a greater number of potentially malfunctioning components. We present a new class of coherence protocols for DSM systems whose instances o er highly available access to shared data a...
متن کاملAn Optimal Preventive Maintenance Model to Enhance Availability and Reliability of Flexible Manufacturing Systems
General preventive maintenance model for the components of a system, which improves the reliability to ‘as good as new,’ was used to optimize the maintenance cost. The cost function of a maintenance policy was minimized under given availability constraint. On the other hand, in order to ensure appropriate reliability and availability, the development of the optimal maintenanc...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001